AITopics | interface prediction

TowardsStableRepresentationsforProtein InterfacePrediction

Neural Information Processing SystemsFeb-16-2026, 08:14:57 GMT

This work focuses on protein interface prediction, which aims to determine whether a pair of residues from different proteins interact.

artificial intelligence, machine learning, representation, (17 more...)

Neural Information Processing Systems

Country:

Asia > China > Guangdong Province > Shenzhen (0.04)
Asia > China > Hong Kong (0.04)
Asia > China > Guangdong Province > Guangzhou (0.04)

Genre: Research Report (0.46)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.94)

Add feedback

End-to-End Learning on 3D Protein Structure for Interface Prediction

Raphael Townshend, Rishi Bedi, Patricia Suriana, Ron Dror

Neural Information Processing SystemsFeb-12-2026, 12:25:53 GMT

Toinvestigatethisproblem, wemine theProtein Data Bank (PDB) [1]toconstruct alargedataset of protein complex structures for which structures of the individual proteins on their own are not available.

artificial intelligence, arxiv, machine learning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.05)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Asia > Middle East > Jordan (0.04)

Industry:

Government > Regional Government (0.46)
Health & Medicine > Pharmaceuticals & Biotechnology (0.32)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)

Add feedback

End-to-End Learning on 3D Protein Structure for Interface Prediction

Neural Information Processing SystemsDec-25-2025, 12:48:37 GMT

Despite an explosion in the number of experimentally determined, atomically detailed structures of biomolecules, many critical tasks in structural biology remain data-limited. Whether performance in such tasks can be improved by using large repositories of tangentially related structural data remains an open question. To address this question, we focused on a central problem in biology: predicting how proteins interact with one another--that is, which surfaces of one protein bind to those of another protein. We built a training dataset, the Database of Interacting Protein Structures (DIPS), that contains biases but is two orders of magnitude larger than those used previously. We found that these biases significantly degrade the performance of existing methods on gold-standard data. Hypothesizing that assumptions baked into the hand-crafted features on which these methods depend were the source of the problem, we developed the first end-to-end learning model for protein interface prediction, the Siamese Atomic Surfacelet Network (SASNet). Using only spatial coordinates and identities of atoms, SASNet outperforms state-of-the-art methods trained on gold-standard structural data, even when trained on only 3% of our new dataset.

end-to-end learning, name change, protein structure, (4 more...)

Neural Information Processing Systems

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.99)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.40)

Add feedback

85be6406bc3b93649a12b4074100c00b-Paper-Conference.pdf

Neural Information Processing SystemsOct-10-2025, 08:21:34 GMT

prediction, protein, representation, (15 more...)

Neural Information Processing Systems

Country:

Asia > China > Hong Kong (0.04)
Asia > China > Guangdong Province > Shenzhen (0.04)
Asia > China > Guangdong Province > Guangzhou (0.04)

Genre: Research Report > Experimental Study (0.46)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)
(2 more...)

Add feedback

End-to-End Learning on 3D Protein Structure for Interface Prediction

Raphael Townshend, Rishi Bedi, Patricia Suriana, Ron Dror

Neural Information Processing SystemsOct-2-2025, 22:48:30 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, machine learning, protein, (15 more...)

Neural Information Processing Systems

Country: North America > United States (0.47)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.97)

Add feedback

Reviews: End-to-End Learning on 3D Protein Structure for Interface Prediction

Neural Information Processing SystemsJan-24-2025, 13:07:20 GMT

The authors propose the first end-to-end learning model for protein interface prediction, the Siamese Atomic Surfacelet Network (SASNet). The novelty of the method is that it only uses spatial coordinates and identities of atoms as inputs, instead of relying on hand-crafted features. The authors also introduce the Dataset of Interacting Protein Structures (DIPS) which increases the amount of binary protein interactions by two orders of magnitude over previously used datasets (DB5). The results outperform state-of-the-art methods when trained on the much larger DIPS dataset and are still comparable when trained on the DB5 dataset, showing robustness when trained on bound or unbound proteins. The paper is very well written and easy to follow.

end-to-end learning, interface prediction, protein structure, (10 more...)

Neural Information Processing Systems

Genre: Research Report (0.39)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.62)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.52)

Add feedback

End-to-End Learning on 3D Protein Structure for Interface Prediction

Neural Information Processing SystemsOct-10-2024, 05:56:22 GMT

Despite an explosion in the number of experimentally determined, atomically detailed structures of biomolecules, many critical tasks in structural biology remain data-limited. Whether performance in such tasks can be improved by using large repositories of tangentially related structural data remains an open question. To address this question, we focused on a central problem in biology: predicting how proteins interact with one another--that is, which surfaces of one protein bind to those of another protein. We built a training dataset, the Database of Interacting Protein Structures (DIPS), that contains biases but is two orders of magnitude larger than those used previously. We found that these biases significantly degrade the performance of existing methods on gold-standard data.

end-to-end learning, interface prediction, protein structure, (2 more...)

Neural Information Processing Systems

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.99)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.43)

Add feedback

Revealing data leakage in protein interaction benchmarks

Bushuiev, Anton, Bushuiev, Roman, Sedlar, Jiri, Pluskal, Tomas, Damborsky, Jiri, Mazurenko, Stanislav, Sivic, Josef

arXiv.org Artificial IntelligenceApr-16-2024

In recent years, there has been remarkable progress in machine learning for protein-protein interactions. However, prior work has predominantly focused on improving learning algorithms, with less attention paid to evaluation strategies and data preparation. Here, we demonstrate that further development of machine learning methods may be hindered by the quality of existing train-test splits. Specifically, we find that commonly used splitting strategies for protein complexes, based on protein sequence or metadata similarity, introduce major data leakage. This may result in overoptimistic evaluation of generalization, as well as unfair benchmarking of the models, biased towards assessing their overfitting capacity rather than practical utility. To overcome the data leakage, we recommend constructing data splits based on 3D structural similarity of protein-protein interfaces and suggest corresponding algorithms. We believe that addressing the data leakage problem is critical for further progress in this research area. The vast majority of protein-protein interactions remain undiscovered.

interaction, interface, similarity, (13 more...)

arXiv.org Artificial Intelligence

2404.10457

Country: Europe > Czechia > South Moravian Region > Brno (0.04)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback

Deep Learning of High-Order Interactions for Protein Interface Prediction

Liu, Yi, Yuan, Hao, Cai, Lei, Ji, Shuiwang

arXiv.org Machine LearningJul-18-2020

Protein interactions are important in a broad range of biological processes. Traditionally, computational methods have been developed to automatically predict protein interface from hand-crafted features. Recent approaches employ deep neural networks and predict the interaction of each amino acid pair independently. However, these methods do not incorporate the important sequential information from amino acid chains and the high-order pairwise interactions. Intuitively, the prediction of an amino acid pair should depend on both their features and the information of other amino acid pairs. In this work, we propose to formulate the protein interface prediction as a 2D dense prediction problem. In addition, we propose a novel deep model to incorporate the sequential information and high-order pairwise interactions to perform interface predictions. We represent proteins as graphs and employ graph neural networks to learn node features. Then we propose the sequential modeling method to incorporate the sequential information and reorder the feature matrix. Next, we incorporate high-order pairwise interactions to generate a 3D tensor containing different pairwise interactions. Finally, we employ convolutional neural networks to perform 2D dense predictions. Experimental results on multiple benchmarks demonstrate that our proposed method can consistently improve the protein interface prediction performance.

artificial intelligence, information, machine learning, (17 more...)

arXiv.org Machine Learning

doi: 10.1145/3394486.3403110

2007.09334

Country:

North America > United States > Texas > Brazos County > College Station (0.14)
North America > United States > Washington > Whitman County > Pullman (0.04)
North America > United States > New York > New York County > New York City (0.04)

Genre: Research Report (1.00)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

End-to-End Learning on 3D Protein Structure for Interface Prediction

Townshend, Raphael, Bedi, Rishi, Suriana, Patricia, Dror, Ron

Neural Information Processing SystemsMar-19-2020, 03:04:04 GMT

Despite an explosion in the number of experimentally determined, atomically detailed structures of biomolecules, many critical tasks in structural biology remain data-limited. Whether performance in such tasks can be improved by using large repositories of tangentially related structural data remains an open question. To address this question, we focused on a central problem in biology: predicting how proteins interact with one another--that is, which surfaces of one protein bind to those of another protein. We built a training dataset, the Database of Interacting Protein Structures (DIPS), that contains biases but is two orders of magnitude larger than those used previously. We found that these biases significantly degrade the performance of existing methods on gold-standard data.

end-to-end learning, interface prediction, protein structure, (2 more...)

Neural Information Processing Systems

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.99)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.47)

Add feedback